Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean
نویسندگان
چکیده
The Momel algorithm provides an automatic factoring of raw fundamental frequency into two components: a microprosodic component, corresponding to local variations of pitch caused by the phonetic nature of the speech segments and a macroprosodic component corresponding to the overall pitch pattern of the utterance which is then represented as a sequence of pitch targets. An earlier evaluation estimated the overall efficiency of the algorithm (F-measure) at around 95% on a corpus of read speech for 5 European languages and at around 93% for a corpus of spontaneous speech. In this paper we present the results of the evaluation of the output of two versions of the Momel algorithm as compared with manually corrected pitch targets for a corpus of just over 2 hours of read speech in Korean (40 continuous 5-sentence passages, each read by 5 male and 5 female speakers). The results show that the new version of the Momel algorithm performs systematically better than the earlier version.
منابع مشابه
Korean MULTEXT: A Korean Prosody Corpus
This paper describes the contents of the Korean prosody corpus (Korean MULTEXT), which is a Korean version of the speech database Eurom1. The corpus consists of about 2 hours of read speech, transcribed primarily in orthography (in Korean alphabet and in a Romanized transcription), in IPA and in SAMPA. Furthermore, it includes the original F0 values, stylized F0 values extracted using Momel, an...
متن کاملAutomatic analysis of the intonation of a tone language. applying the momel algorithm to spontaneous standard Chinese (beijing)
This paper describes the application of the Momel algorithm to a corpus of spontaneous speech in Standard (Beijing) Chinese. A selection of utterances by four speakers was analysed automatically and the resynthesised utterances were evaluated subjectively with two categories of errors: lexical tone errors and intonation errors. The target points determining the pitch contours of the synthetic u...
متن کاملAutomatic adaptation of the momel F0 stylisation algorithm to new corpora
The paper investigates the adaptability of the MoMel (Modelling of Melody) Algorithm [1] to new corpora. A detailed overview of the MoMel algorithm and its parameters are presented. The generality of the default parameter values to new corpora is studied empirically. Two of the parameters, related to window durations, are discovered to be highly corpus dependant. The paper presents a significan...
متن کاملمقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملPitch parameters for pr A preliminary comparison o
The search for objective paradigms for establishing prosodic typologies among languages is a major challenge for speech science. Recent work in the area of speech rhythm has shown that an appropriate choice of parameters can provide revealing evidence for traditional typological classifications. In the area of pitch there has been less activity. This paper presents a preliminary comparison of p...
متن کامل